A Modified Reinforcement Learning Scheme For Controlling The Mean Arterial Pressure
نویسنده
چکیده
The original reinforcement learning scheme comprises two networks, one performs a controller and the other stands for an evaluator. Based on temporal difference predictive techniques, the evaluative network predicts an external reinforcement signal and estimates a more informative internal signal to adapt a set of parameters of the controller. This paper introduces a modified reinforcement learning scheme to simplify the original scheme. The mean theme of the proposed scheme is that it replaces the fuzzy neural network (FNN) evaluator by a simple evaluator that depends directly on the environment of the process be controlled. Based on a performance index, the proposed evaluator outputs a reinforcement signal in the range of [-1,1] using two methods, one is fuzzy and the other is a discrete uniform of reinforcement signals. Compared with the original reinforcement scheme, the computational demand of the proposed scheme is relatively slight. That makes the proposed reinforcement scheme suitable for controlling real time intensive processes. The mean features of the proposed scheme are reflected in our simulation results on controlling the mean arterial pressure system.
منابع مشابه
Controlling chaos by GA-based reinforcement learning neural network
This paper proposes a TD (temporal difference) and GA (genetic algorithm) based reinforcement (TDGAR) neural learning scheme for controlling chaotic dynamical systems based on the technique of small perturbations. The TDGAR learning scheme is a new hybrid GA, which integrates the TD prediction method and the GA to fulfill the reinforcement learning task. Structurely, the TDGAR learning system i...
متن کاملA Threshold-based Model of Reinforcement Learning
A generic and scalable Reinforcement Learning scheme for Artificial Neural Networks is presented, providing a general purpose learning machine. By reference to a node threshold three features are described 1) A mechanism for Primary Reinforcement, capable of solving linearly inseparable problems 2) The learning scheme is extended to include a mechanism for Conditioned Reinforcement, capable of ...
متن کاملتخمین فشار متوسط شریانی توسط پایش تهاجمی فشار شریانی با مانومتر
Background: Direct monitoring of arterial pressure using a transducer system is not affordable in most operating rooms and ICU wards in Iran. It is, however, possible to use an aneroid manometer instead, but it is not standardized yet, nor studied enough and its measurements may not be interpretable. Methods: To study the correlation of the arterial pressure readings between a manometer and a ...
متن کاملSurvey of effective factors on learning motivation of clinical students and suggesting the appropriate methods for reinforcement the learning motivation from the viewpoints of nursing and midwifery faculty, Tabriz University of Medical Sciences 2002.
Introduction. Motives are the powerful force in process of education– learning, so that the richest and best training plans and structured education are not effective if the lack of motivation existed. In spite of the fact that the success of teacher depends on the learning motivation of students, then it is necessary for teachers to know the effective methods for motivating the students and t...
متن کاملRRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کامل